A New Approach in Computer Representation of Bangla Words and Bangla Sorting Algorithm
نویسندگان
چکیده
Development of Bangla based computer application is relatively complex due to the complexities of Bangla character set (for example computer representation of composite letters). This paper focuses on a new technique on internal representation of Bangla words in computer system along with a Bangla word sorting algorithm using that representation. Here, we propose a special technique which converts a Bangla word into a unique real number. Now, if the numbers corresponding to a given set of Bangla words are sorted using any of the familiar sorting algorithms then we get the sorted order of the words in that set which is simply the sorted order of the numbers that represents words. Our algorithm compares real numbers rather than characters to sort the words and thus decreases the difficulties of character comparing which exists in many of the current Bangla sorting algorithm.
منابع مشابه
A New Approach to Bangla Text Extraction and Recognition From Textual Image
This paper presents a new approach to segment and recognize Printed Bangla Text using Characteristic functions and Hamming network. The main difficulties in printed Bangla text recognition are the separation of lines, words and individual characters. In this paper, a new algorithm has been proposed to detect and separate text lines, words and characters from printed Bangla text. The algorithm u...
متن کاملPhonetic Bengali Input Method for Computer and Mobile Devices
Current mobile devices do not support Bangla (or Bengali) Input method. Due to this many Bangla language speakers have to write Bangla in mobile phone using English alphabets. During this time they used to write English foreign words using English spelling. This tendency also exists when writing in computer using phonetically input methods, which cause many typing mistakes. In this scenario, co...
متن کاملA Bangla Phonetic Encoding for Better Spelling Suggestions
We present a phonetic encoding for Bangla that can be used by spelling checkers to provide better suggestions for misspelled words. The encoding is based on the Soundex algorithm, modified to match Bangla phonetics. We start by analyzing Soundex encoding scheme when applied to Bangla. &ext we propose a new encoding that handles the case of Bangla words, including those containing conjuncts. We ...
متن کاملSeparating Words from Continuous Bangla Speech T
In this paper we present a new word separation algorithm for Real Time Speech i.e., Continuous Bangla Speech Recognition (CBSR). Prosody has great impact on Bangla speech and the algorithm is developed by considering prosodic feature with energy. Task of this algorithm is to separate Bangla speech into words. At first continuous Bangla speech are fed into the system and the word separation algo...
متن کاملBlocking Black Area Method for Speech Segmentation
Speech segmentation is an important sub problem of automatic speech recognition. This research is concerned with the development of a continuous speech segmentation system using Bangla Language. This paper presents a dynamic thresholding algorithm to segment the continuous Bngla speech sentences into words/sub-words. The research uses Otsu’s method for dynamic thresholding and introduces a new ...
متن کامل